Dataset statistics
| Number of variables | 40 |
|---|---|
| Number of observations | 3382812 |
| Missing cells | 66696504 |
| Missing cells (%) | 49.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.0 GiB |
| Average record size in memory | 320.0 B |
Variable types
| Categorical | 16 |
|---|---|
| Numeric | 18 |
| Unsupported | 6 |
id_mutation has a high cardinality: 1447220 distinct values | High cardinality |
date_mutation has a high cardinality: 365 distinct values | High cardinality |
adresse_nom_voie has a high cardinality: 478212 distinct values | High cardinality |
adresse_code_voie has a high cardinality: 15865 distinct values | High cardinality |
nom_commune has a high cardinality: 30549 distinct values | High cardinality |
ancien_nom_commune has a high cardinality: 782 distinct values | High cardinality |
id_parcelle has a high cardinality: 2029416 distinct values | High cardinality |
ancien_id_parcelle has a high cardinality: 16409 distinct values | High cardinality |
code_nature_culture_speciale has a high cardinality: 125 distinct values | High cardinality |
nature_culture_speciale has a high cardinality: 125 distinct values | High cardinality |
code_postal is highly correlated with ancien_code_commune | High correlation |
ancien_code_commune is highly correlated with code_postal | High correlation |
lot1_surface_carrez is highly correlated with lot4_surface_carrez and 4 other fields | High correlation |
lot2_surface_carrez is highly correlated with lot3_surface_carrez and 4 other fields | High correlation |
lot3_surface_carrez is highly correlated with lot2_surface_carrez and 3 other fields | High correlation |
lot4_numero is highly correlated with lot5_numero | High correlation |
lot4_surface_carrez is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
lot5_numero is highly correlated with lot4_numero | High correlation |
lot5_surface_carrez is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
code_type_local is highly correlated with nombre_pieces_principales | High correlation |
surface_reelle_bati is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
nombre_pieces_principales is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
surface_terrain is highly correlated with lot1_surface_carrez | High correlation |
valeur_fonciere is highly correlated with lot4_surface_carrez | High correlation |
code_postal is highly correlated with ancien_code_commune | High correlation |
ancien_code_commune is highly correlated with code_postal | High correlation |
lot1_surface_carrez is highly correlated with lot5_surface_carrez | High correlation |
lot2_surface_carrez is highly correlated with lot3_surface_carrez and 2 other fields | High correlation |
lot3_surface_carrez is highly correlated with lot2_surface_carrez and 2 other fields | High correlation |
lot4_numero is highly correlated with lot5_numero | High correlation |
lot4_surface_carrez is highly correlated with valeur_fonciere and 3 other fields | High correlation |
lot5_numero is highly correlated with lot4_numero | High correlation |
lot5_surface_carrez is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
code_type_local is highly correlated with nombre_pieces_principales | High correlation |
nombre_pieces_principales is highly correlated with code_type_local | High correlation |
code_postal is highly correlated with ancien_code_commune | High correlation |
ancien_code_commune is highly correlated with code_postal | High correlation |
lot1_surface_carrez is highly correlated with surface_reelle_bati and 1 other fields | High correlation |
lot2_surface_carrez is highly correlated with lot4_surface_carrez and 2 other fields | High correlation |
lot3_surface_carrez is highly correlated with lot4_surface_carrez and 2 other fields | High correlation |
lot4_numero is highly correlated with lot5_numero | High correlation |
lot4_surface_carrez is highly correlated with lot2_surface_carrez and 2 other fields | High correlation |
lot5_numero is highly correlated with lot4_numero | High correlation |
lot5_surface_carrez is highly correlated with lot3_surface_carrez and 1 other fields | High correlation |
code_type_local is highly correlated with nombre_pieces_principales | High correlation |
surface_reelle_bati is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
nombre_pieces_principales is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
code_nature_culture is highly correlated with nature_culture | High correlation |
nature_culture is highly correlated with code_nature_culture | High correlation |
code_type_local is highly correlated with type_local | High correlation |
type_local is highly correlated with code_type_local | High correlation |
adresse_numero is highly correlated with adresse_suffixe | High correlation |
adresse_suffixe is highly correlated with adresse_numero and 3 other fields | High correlation |
code_postal is highly correlated with ancien_code_commune and 1 other fields | High correlation |
ancien_code_commune is highly correlated with adresse_suffixe and 2 other fields | High correlation |
lot1_surface_carrez is highly correlated with lot4_surface_carrez and 1 other fields | High correlation |
lot2_surface_carrez is highly correlated with lot3_surface_carrez and 2 other fields | High correlation |
lot3_surface_carrez is highly correlated with lot2_surface_carrez and 2 other fields | High correlation |
lot4_numero is highly correlated with lot5_numero | High correlation |
lot4_surface_carrez is highly correlated with lot1_surface_carrez and 4 other fields | High correlation |
lot5_numero is highly correlated with lot4_numero | High correlation |
lot5_surface_carrez is highly correlated with ancien_code_commune and 4 other fields | High correlation |
code_type_local is highly correlated with type_local | High correlation |
type_local is highly correlated with code_type_local | High correlation |
code_nature_culture is highly correlated with adresse_suffixe and 1 other fields | High correlation |
nature_culture is highly correlated with adresse_suffixe and 1 other fields | High correlation |
longitude is highly correlated with code_postal and 2 other fields | High correlation |
latitude is highly correlated with longitude | High correlation |
valeur_fonciere has 45338 (1.3%) missing values | Missing |
adresse_numero has 1395778 (41.3%) missing values | Missing |
adresse_suffixe has 3237253 (95.7%) missing values | Missing |
ancien_code_commune has 3323835 (98.3%) missing values | Missing |
ancien_nom_commune has 3323835 (98.3%) missing values | Missing |
ancien_id_parcelle has 3361806 (99.4%) missing values | Missing |
numero_volume has 3373972 (99.7%) missing values | Missing |
lot1_numero has 2313128 (68.4%) missing values | Missing |
lot1_surface_carrez has 3104314 (91.8%) missing values | Missing |
lot2_numero has 3161194 (93.4%) missing values | Missing |
lot2_surface_carrez has 3314519 (98.0%) missing values | Missing |
lot3_numero has 3346137 (98.9%) missing values | Missing |
lot3_surface_carrez has 3376163 (99.8%) missing values | Missing |
lot4_numero has 3370210 (99.6%) missing values | Missing |
lot4_surface_carrez has 3381060 (99.9%) missing values | Missing |
lot5_numero has 3376821 (99.8%) missing values | Missing |
lot5_surface_carrez has 3382060 (> 99.9%) missing values | Missing |
code_type_local has 1508037 (44.6%) missing values | Missing |
type_local has 1508037 (44.6%) missing values | Missing |
surface_reelle_bati has 1992066 (58.9%) missing values | Missing |
nombre_pieces_principales has 1510700 (44.7%) missing values | Missing |
code_nature_culture has 1079606 (31.9%) missing values | Missing |
nature_culture has 1079606 (31.9%) missing values | Missing |
code_nature_culture_speciale has 3230429 (95.5%) missing values | Missing |
nature_culture_speciale has 3230429 (95.5%) missing values | Missing |
surface_terrain has 1079656 (31.9%) missing values | Missing |
longitude has 99869 (3.0%) missing values | Missing |
latitude has 99869 (3.0%) missing values | Missing |
numero_disposition is highly skewed (γ1 = 43.05693677) | Skewed |
lot1_surface_carrez is highly skewed (γ1 = 27.3471126) | Skewed |
lot2_surface_carrez is highly skewed (γ1 = 45.2560629) | Skewed |
lot4_numero is highly skewed (γ1 = 57.55405177) | Skewed |
lot5_numero is highly skewed (γ1 = 48.48277988) | Skewed |
nombre_lots is highly skewed (γ1 = 33.01288703) | Skewed |
surface_reelle_bati is highly skewed (γ1 = 305.1601927) | Skewed |
surface_terrain is highly skewed (γ1 = 25.42368337) | Skewed |
code_commune is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
code_departement is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
numero_volume is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lot1_numero is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lot2_numero is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lot3_numero is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
nombre_lots has 2313128 (68.4%) zeros | Zeros |
nombre_pieces_principales has 603218 (17.8%) zeros | Zeros |
Reproduction
| Analysis started | 2021-10-05 22:49:21.818662 |
|---|---|
| Analysis finished | 2021-10-05 23:05:29.977323 |
| Duration | 16 minutes and 8.16 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 1447220 |
|---|---|
| Distinct (%) | 42.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.8 MiB |
| 2017-450452 | 5450 |
|---|---|
| 2017-1310773 | 4580 |
| 2017-1268155 | 4146 |
| 2017-1345691 | 3318 |
| 2017-1327546 | 2499 |
| Other values (1447215) |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 11.20729647 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 698538 ? |
|---|---|
| Unique (%) | 20.6% |
Sample
| 1st row | 2017-1 |
|---|---|
| 2nd row | 2017-2 |
| 3rd row | 2017-3 |
| 4th row | 2017-3 |
| 5th row | 2017-3 |
Common Values
| Value | Count | Frequency (%) |
| 2017-450452 | 5450 | 0.2% |
| 2017-1310773 | 4580 | 0.1% |
| 2017-1268155 | 4146 | 0.1% |
| 2017-1345691 | 3318 | 0.1% |
| 2017-1327546 | 2499 | 0.1% |
| 2017-443526 | 1872 | 0.1% |
| 2017-823869 | 1858 | 0.1% |
| 2017-1345835 | 1756 | 0.1% |
| 2017-1308069 | 1546 | < 0.1% |
| 2017-823877 | 1428 | < 0.1% |
| Other values (1447210) | 3354359 |
Length
| Value | Count | Frequency (%) |
| 2017-450452 | 5450 | 0.2% |
| 2017-1310773 | 4580 | 0.1% |
| 2017-1268155 | 4146 | 0.1% |
| 2017-1345691 | 3318 | 0.1% |
| 2017-1327546 | 2499 | 0.1% |
| 2017-443526 | 1872 | 0.1% |
| 2017-823869 | 1858 | 0.1% |
| 2017-1345835 | 1756 | 0.1% |
| 2017-1308069 | 1546 | < 0.1% |
| 2017-823877 | 1428 | < 0.1% |
| Other values (1447210) | 3354359 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 365 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.8 MiB |
| 2017-04-27 | 59570 |
|---|---|
| 2017-12-29 | 38435 |
| 2017-12-21 | 34643 |
| 2017-12-28 | 32836 |
| 2017-12-22 | 32393 |
| Other values (360) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2017-01-02 |
|---|---|
| 2nd row | 2017-01-05 |
| 3rd row | 2017-01-06 |
| 4th row | 2017-01-06 |
| 5th row | 2017-01-06 |
Common Values
| Value | Count | Frequency (%) |
| 2017-04-27 | 59570 | 1.8% |
| 2017-12-29 | 38435 | 1.1% |
| 2017-12-21 | 34643 | 1.0% |
| 2017-12-28 | 32836 | 1.0% |
| 2017-12-22 | 32393 | 1.0% |
| 2017-06-30 | 29222 | 0.9% |
| 2017-09-29 | 25850 | 0.8% |
| 2017-12-20 | 24964 | 0.7% |
| 2017-05-23 | 24741 | 0.7% |
| 2017-03-31 | 24718 | 0.7% |
| Other values (355) | 3055440 |
Length
| Value | Count | Frequency (%) |
| 2017-04-27 | 59570 | 1.8% |
| 2017-12-29 | 38435 | 1.1% |
| 2017-12-21 | 34643 | 1.0% |
| 2017-12-28 | 32836 | 1.0% |
| 2017-12-22 | 32393 | 1.0% |
| 2017-06-30 | 29222 | 0.9% |
| 2017-09-29 | 25850 | 0.8% |
| 2017-12-20 | 24964 | 0.7% |
| 2017-05-23 | 24741 | 0.7% |
| 2017-03-31 | 24718 | 0.7% |
| Other values (355) | 3055440 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 183 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.171767453 |
| Minimum | 1 |
|---|---|
| Maximum | 185 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 185 |
| Range | 184 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.061452548 |
|---|---|
| Coefficient of variation (CV) | 1.759267629 |
| Kurtosis | 2567.344373 |
| Mean | 1.171767453 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 43.05693677 |
| Sum | 3963869 |
| Variance | 4.249586607 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3134086 | |
| 2 | 195651 | 5.8% |
| 3 | 29589 | 0.9% |
| 4 | 5989 | 0.2% |
| 8 | 4469 | 0.1% |
| 5 | 2375 | 0.1% |
| 6 | 1380 | < 0.1% |
| 28 | 1281 | < 0.1% |
| 7 | 702 | < 0.1% |
| 32 | 576 | < 0.1% |
| Other values (173) | 6714 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 3134086 | |
| 2 | 195651 | 5.8% |
| 3 | 29589 | 0.9% |
| 4 | 5989 | 0.2% |
| 5 | 2375 | 0.1% |
| 6 | 1380 | < 0.1% |
| 7 | 702 | < 0.1% |
| 8 | 4469 | 0.1% |
| 9 | 279 | < 0.1% |
| 10 | 336 | < 0.1% |
| Value | Count | Frequency (%) |
| 185 | 1 | < 0.1% |
| 184 | 3 | |
| 183 | 1 | < 0.1% |
| 182 | 1 | < 0.1% |
| 181 | 1 | < 0.1% |
| 180 | 1 | < 0.1% |
| 178 | 1 | < 0.1% |
| 177 | 2 | |
| 175 | 2 | |
| 174 | 1 | < 0.1% |
nature_mutation
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.8 MiB |
| Vente | |
|---|---|
| Vente en l'état futur d'achèvement | 272048 |
| Echange | 46317 |
| Vente terrain à bâtir | 15037 |
| Adjudication | 14727 |
Length
| Max length | 34 |
|---|---|
| Median length | 5 |
| Mean length | 7.488210104 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Vente |
|---|---|
| 2nd row | Vente |
| 3rd row | Vente |
| 4th row | Vente |
| 5th row | Vente |
Common Values
| Value | Count | Frequency (%) |
| Vente | 3023253 | |
| Vente en l'état futur d'achèvement | 272048 | 8.0% |
| Echange | 46317 | 1.4% |
| Vente terrain à bâtir | 15037 | 0.4% |
| Adjudication | 14727 | 0.4% |
| Expropriation | 11430 | 0.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| vente | 3310338 | |
| d'achèvement | 272048 | 6.0% |
| futur | 272048 | 6.0% |
| l'état | 272048 | 6.0% |
| en | 272048 | 6.0% |
| echange | 46317 | 1.0% |
| bâtir | 15037 | 0.3% |
| à | 15037 | 0.3% |
| terrain | 15037 | 0.3% |
| adjudication | 14727 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 131008 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 45338 |
| Missing (%) | 1.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1217839.376 |
| Minimum | 0.01 |
|---|---|
| Maximum | 686496000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 2500 |
| Q1 | 59000 |
| median | 144462 |
| Q3 | 260000 |
| 95-th percentile | 1642412.1 |
| Maximum | 686496000 |
| Range | 686496000 |
| Interquartile range (IQR) | 201000 |
Descriptive statistics
| Standard deviation | 9447002.924 |
|---|---|
| Coefficient of variation (CV) | 7.757183016 |
| Kurtosis | 274.8009997 |
| Mean | 1217839.376 |
| Median Absolute Deviation (MAD) | 95538 |
| Skewness | 14.50600536 |
| Sum | 4.064507254 × 1012 |
| Variance | 8.924586425 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100000 | 30412 | 0.9% |
| 150000 | 28912 | 0.9% |
| 120000 | 28128 | 0.8% |
| 1 | 25529 | 0.8% |
| 80000 | 24691 | 0.7% |
| 50000 | 24561 | 0.7% |
| 200000 | 23924 | 0.7% |
| 130000 | 23890 | 0.7% |
| 90000 | 23612 | 0.7% |
| 60000 | 23470 | 0.7% |
| Other values (130998) | 3080345 | |
| (Missing) | 45338 | 1.3% |
| Value | Count | Frequency (%) |
| 0.01 | 1 | < 0.1% |
| 0.02 | 3 | < 0.1% |
| 0.04 | 1 | < 0.1% |
| 0.1 | 4 | < 0.1% |
| 0.11 | 2 | < 0.1% |
| 0.12 | 2 | < 0.1% |
| 0.14 | 2 | < 0.1% |
| 0.15 | 580 | |
| 0.16 | 12 | < 0.1% |
| 0.18 | 147 | < 0.1% |
| Value | Count | Frequency (%) |
| 686496000 | 1 | < 0.1% |
| 489708000 | 1 | < 0.1% |
| 476780000 | 27 | |
| 445000000 | 6 | < 0.1% |
| 402000000 | 3 | < 0.1% |
| 335310656 | 17 | < 0.1% |
| 295500000 | 1 | < 0.1% |
| 262243024 | 14 | < 0.1% |
| 218777264 | 2 | < 0.1% |
| 216540000 | 59 |
| Distinct | 7162 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 1395778 |
| Missing (%) | 41.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 792.5980295 |
| Minimum | 1 |
|---|---|
| Maximum | 9999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 26 |
| Q3 | 100 |
| 95-th percentile | 5753 |
| Maximum | 9999 |
| Range | 9998 |
| Interquartile range (IQR) | 92 |
Descriptive statistics
| Standard deviation | 2142.359475 |
|---|---|
| Coefficient of variation (CV) | 2.702958366 |
| Kurtosis | 7.083597824 |
| Mean | 792.5980295 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | 2.872950803 |
| Sum | 1574919233 |
| Variance | 4589704.12 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 94674 | 2.8% |
| 2 | 76718 | 2.3% |
| 3 | 62536 | 1.8% |
| 4 | 60341 | 1.8% |
| 6 | 55192 | 1.6% |
| 5 | 54850 | 1.6% |
| 7 | 49049 | 1.4% |
| 8 | 47924 | 1.4% |
| 10 | 44852 | 1.3% |
| 9 | 41920 | 1.2% |
| Other values (7152) | 1398978 | |
| (Missing) | 1395778 |
| Value | Count | Frequency (%) |
| 1 | 94674 | |
| 2 | 76718 | |
| 3 | 62536 | |
| 4 | 60341 | |
| 5 | 54850 | |
| 6 | 55192 | |
| 7 | 49049 | |
| 8 | 47924 | |
| 9 | 41920 | |
| 10 | 44852 |
| Value | Count | Frequency (%) |
| 9999 | 402 | |
| 9998 | 44 | < 0.1% |
| 9997 | 15 | < 0.1% |
| 9996 | 20 | < 0.1% |
| 9995 | 10 | < 0.1% |
| 9994 | 25 | < 0.1% |
| 9993 | 3 | < 0.1% |
| 9992 | 4 | < 0.1% |
| 9991 | 15 | < 0.1% |
| 9990 | 16 | < 0.1% |
| Distinct | 41 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3237253 |
| Missing (%) | 95.7% |
| Memory size | 25.8 MiB |
| B | |
|---|---|
| A | |
| F | |
| T | |
| C | 4758 |
| Other values (36) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B |
|---|---|
| 2nd row | B |
| 3rd row | B |
| 4th row | B |
| 5th row | B |
Common Values
| Value | Count | Frequency (%) |
| B | 83429 | 2.5% |
| A | 22891 | 0.7% |
| F | 12996 | 0.4% |
| T | 11192 | 0.3% |
| C | 4758 | 0.1% |
| D | 2193 | 0.1% |
| E | 1253 | < 0.1% |
| Q | 1250 | < 0.1% |
| U | 1167 | < 0.1% |
| P | 991 | < 0.1% |
| Other values (31) | 3439 | 0.1% |
| (Missing) | 3237253 |
Length
| Value | Count | Frequency (%) |
| b | 83429 | |
| a | 22891 | 15.7% |
| f | 12996 | 8.9% |
| t | 11192 | 7.7% |
| c | 4758 | 3.3% |
| d | 2193 | 1.5% |
| e | 1253 | 0.9% |
| q | 1250 | 0.9% |
| u | 1167 | 0.8% |
| p | 991 | 0.7% |
| Other values (27) | 3439 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 478212 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 30162 |
| Missing (%) | 0.9% |
| Memory size | 25.8 MiB |
| LE VILLAGE | 32081 |
|---|---|
| LE BOURG | 26888 |
| RUE DE LA REPUBLIQUE | 6740 |
| RUE JEAN JAURES | 6026 |
| GR GRANDE RUE | 5854 |
| Other values (478207) |
Length
| Max length | 31 |
|---|---|
| Median length | 14 |
| Mean length | 14.69918244 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 162426 ? |
|---|---|
| Unique (%) | 4.8% |
Sample
| 1st row | RUE CHARLES ROBIN |
|---|---|
| 2nd row | LES VAVRES |
| 3rd row | LA POIPE |
| 4th row | LA POIPE |
| 5th row | LA POIPE |
Common Values
| Value | Count | Frequency (%) |
| LE VILLAGE | 32081 | 0.9% |
| LE BOURG | 26888 | 0.8% |
| RUE DE LA REPUBLIQUE | 6740 | 0.2% |
| RUE JEAN JAURES | 6026 | 0.2% |
| GR GRANDE RUE | 5854 | 0.2% |
| RUE PASTEUR | 5706 | 0.2% |
| RUE VICTOR HUGO | 5587 | 0.2% |
| AV JEAN JAURES | 5473 | 0.2% |
| RUE DE PARIS | 5357 | 0.2% |
| AV DE LA REPUBLIQUE | 4692 | 0.1% |
| Other values (478202) | 3248246 | |
| (Missing) | 30162 | 0.9% |
Length
| Value | Count | Frequency (%) |
| rue | 1118363 | 11.6% |
| de | 735850 | 7.6% |
| la | 495547 | 5.1% |
| du | 334491 | 3.5% |
| le | 286518 | 3.0% |
| des | 283200 | 2.9% |
| av | 270931 | 2.8% |
| les | 231098 | 2.4% |
| che | 107233 | 1.1% |
| rte | 101042 | 1.0% |
| Other values (200164) | 5665356 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 15865 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 30113 |
| Missing (%) | 0.9% |
| Memory size | 25.8 MiB |
| 0020 | 17335 |
|---|---|
| B005 | 15623 |
| B004 | 15312 |
| B008 | 15243 |
| B002 | 15034 |
| Other values (15860) |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1276 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0820 |
|---|---|
| 2nd row | B032 |
| 3rd row | B080 |
| 4th row | B080 |
| 5th row | B080 |
Common Values
| Value | Count | Frequency (%) |
| 0020 | 17335 | 0.5% |
| B005 | 15623 | 0.5% |
| B004 | 15312 | 0.5% |
| B008 | 15243 | 0.5% |
| B002 | 15034 | 0.4% |
| B003 | 14943 | 0.4% |
| B010 | 14793 | 0.4% |
| B007 | 14740 | 0.4% |
| B013 | 14738 | 0.4% |
| B006 | 14617 | 0.4% |
| Other values (15855) | 3200321 | |
| (Missing) | 30113 | 0.9% |
Length
| Value | Count | Frequency (%) |
| 0020 | 17335 | 0.5% |
| b005 | 15623 | 0.5% |
| b004 | 15312 | 0.5% |
| b008 | 15243 | 0.5% |
| b002 | 15034 | 0.4% |
| b003 | 14943 | 0.4% |
| b010 | 14793 | 0.4% |
| b007 | 14740 | 0.4% |
| b013 | 14738 | 0.4% |
| b006 | 14617 | 0.4% |
| Other values (15855) | 3200321 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 5868 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 30502 |
| Missing (%) | 0.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50696.88167 |
| Minimum | 1000 |
|---|---|
| Maximum | 97490 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 6700 |
| Q1 | 29630 |
| median | 49400 |
| Q3 | 75015 |
| 95-th percentile | 93140 |
| Maximum | 97490 |
| Range | 96490 |
| Interquartile range (IQR) | 45385 |
Descriptive statistics
| Standard deviation | 27399.42192 |
|---|---|
| Coefficient of variation (CV) | 0.5404557642 |
| Kurtosis | -1.196477499 |
| Mean | 50696.88167 |
| Median Absolute Deviation (MAD) | 23750 |
| Skewness | -0.009140286181 |
| Sum | 1.699516634 × 1011 |
| Variance | 750728321.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31200 | 8834 | 0.3% |
| 35000 | 8373 | 0.2% |
| 69100 | 7990 | 0.2% |
| 21000 | 7768 | 0.2% |
| 59300 | 7609 | 0.2% |
| 54000 | 7088 | 0.2% |
| 75015 | 6865 | 0.2% |
| 75016 | 6681 | 0.2% |
| 51100 | 6571 | 0.2% |
| 29200 | 6318 | 0.2% |
| Other values (5858) | 3278213 | |
| (Missing) | 30502 | 0.9% |
| Value | Count | Frequency (%) |
| 1000 | 1853 | |
| 1090 | 399 | < 0.1% |
| 1100 | 1047 | |
| 1110 | 599 | < 0.1% |
| 1120 | 763 | |
| 1130 | 467 | < 0.1% |
| 1140 | 460 | < 0.1% |
| 1150 | 1115 | |
| 1160 | 655 | < 0.1% |
| 1170 | 1435 |
| Value | Count | Frequency (%) |
| 97490 | 1785 | |
| 97480 | 521 | < 0.1% |
| 97470 | 241 | < 0.1% |
| 97460 | 739 | |
| 97450 | 185 | < 0.1% |
| 97442 | 47 | < 0.1% |
| 97441 | 217 | < 0.1% |
| 97440 | 717 | |
| 97439 | 49 | < 0.1% |
| 97438 | 563 | < 0.1% |
| Distinct | 30549 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.8 MiB |
| Toulouse | 31001 |
|---|---|
| Nantes | 19016 |
| Bordeaux | 17624 |
| Nice | 17192 |
| Montpellier | 16912 |
| Other values (30544) |
Length
| Max length | 45 |
|---|---|
| Median length | 10 |
| Mean length | 11.84390885 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 356 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Bourg-en-Bresse |
|---|---|
| 2nd row | Péronnas |
| 3rd row | Saint-Cyr-sur-Menthon |
| 4th row | Saint-Cyr-sur-Menthon |
| 5th row | Saint-Cyr-sur-Menthon |
Common Values
| Value | Count | Frequency (%) |
| Toulouse | 31001 | 0.9% |
| Nantes | 19016 | 0.6% |
| Bordeaux | 17624 | 0.5% |
| Nice | 17192 | 0.5% |
| Montpellier | 16912 | 0.5% |
| Rennes | 13582 | 0.4% |
| Lille | 12653 | 0.4% |
| Villeurbanne | 8158 | 0.2% |
| Dijon | 7890 | 0.2% |
| Aix-en-Provence | 7826 | 0.2% |
| Other values (30539) | 3230958 |
Length
| Value | Count | Frequency (%) |
| arrondissement | 124918 | 3.2% |
| la | 102321 | 2.6% |
| le | 95086 | 2.4% |
| paris | 64921 | 1.7% |
| marseille | 35619 | 0.9% |
| les | 35379 | 0.9% |
| toulouse | 31001 | 0.8% |
| lyon | 24378 | 0.6% |
| nantes | 19016 | 0.5% |
| bordeaux | 17624 | 0.4% |
| Other values (30459) | 3375620 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
ancien_code_commune
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 787 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 3323835 |
| Missing (%) | 98.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 56822.89601 |
| Minimum | 1025 |
|---|---|
| Maximum | 95308 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 1025 |
|---|---|
| 5-th percentile | 9334 |
| Q1 | 35100 |
| median | 56165 |
| Q3 | 80485 |
| 95-th percentile | 93070 |
| Maximum | 95308 |
| Range | 94283 |
| Interquartile range (IQR) | 45385 |
Descriptive statistics
| Standard deviation | 27504.53236 |
|---|---|
| Coefficient of variation (CV) | 0.4840396088 |
| Kurtosis | -1.14627258 |
| Mean | 56822.89601 |
| Median Absolute Deviation (MAD) | 23164 |
| Skewness | -0.3339921404 |
| Sum | 3351243938 |
| Variance | 756499300.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 93070 | 2888 | 0.1% |
| 85194 | 1919 | 0.1% |
| 91228 | 1746 | 0.1% |
| 85166 | 1479 | < 0.1% |
| 78551 | 1427 | < 0.1% |
| 95306 | 1021 | < 0.1% |
| 73257 | 1016 | < 0.1% |
| 85060 | 917 | < 0.1% |
| 22050 | 915 | < 0.1% |
| 78158 | 695 | < 0.1% |
| Other values (777) | 44954 | 1.3% |
| (Missing) | 3323835 |
| Value | Count | Frequency (%) |
| 1025 | 200 | |
| 1033 | 413 | |
| 1036 | 96 | < 0.1% |
| 1059 | 16 | < 0.1% |
| 1091 | 255 | |
| 1097 | 45 | < 0.1% |
| 1122 | 51 | < 0.1% |
| 1130 | 63 | < 0.1% |
| 1144 | 53 | < 0.1% |
| 1154 | 37 | < 0.1% |
| Value | Count | Frequency (%) |
| 95308 | 27 | < 0.1% |
| 95306 | 1021 | < 0.1% |
| 95040 | 31 | < 0.1% |
| 93070 | 2888 | |
| 91390 | 213 | < 0.1% |
| 91228 | 1746 | |
| 91222 | 12 | < 0.1% |
| 91182 | 506 | < 0.1% |
| 90073 | 9 | < 0.1% |
| 90068 | 39 | < 0.1% |
| Distinct | 782 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 3323835 |
| Missing (%) | 98.3% |
| Memory size | 25.8 MiB |
| Saint-Ouen | 2888 |
|---|---|
| Les Sables-d'Olonne | 1919 |
| Évry | 1746 |
| Olonne-sur-Mer | 1479 |
| Saint-Germain-en-Laye | 1427 |
| Other values (777) |
Length
| Max length | 30 |
|---|---|
| Median length | 10 |
| Mean length | 12.52469607 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Cras-sur-Reyssouze |
|---|---|
| 2nd row | Étrez |
| 3rd row | Bâgé-la-Ville |
| 4th row | Cras-sur-Reyssouze |
| 5th row | Bâgé-la-Ville |
Common Values
| Value | Count | Frequency (%) |
| Saint-Ouen | 2888 | 0.1% |
| Les Sables-d'Olonne | 1919 | 0.1% |
| Évry | 1746 | 0.1% |
| Olonne-sur-Mer | 1479 | < 0.1% |
| Saint-Germain-en-Laye | 1427 | < 0.1% |
| Herblay | 1021 | < 0.1% |
| Les Belleville | 1016 | < 0.1% |
| Château-d'Olonne | 917 | < 0.1% |
| Dinan | 915 | < 0.1% |
| Le Chesnay | 695 | < 0.1% |
| Other values (772) | 44954 | 1.3% |
| (Missing) | 3323835 |
Length
| Value | Count | Frequency (%) |
| les | 4805 | 6.8% |
| saint-ouen | 2888 | 4.1% |
| sables-d'olonne | 1919 | 2.7% |
| évry | 1746 | 2.5% |
| la | 1699 | 2.4% |
| belleville | 1494 | 2.1% |
| olonne-sur-mer | 1479 | 2.1% |
| saint-germain-en-laye | 1427 | 2.0% |
| le | 1280 | 1.8% |
| herblay | 1021 | 1.5% |
| Other values (791) | 50447 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2029416 |
|---|---|
| Distinct (%) | 60.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.8 MiB |
| 91286000AR0114 | 3058 |
|---|---|
| 33193000AD0001 | 2812 |
| 29039000BL0050 | 1340 |
| 33193000AC0001 | 1287 |
| 33193000AA0002 | 1136 |
| Other values (2029411) |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1619045 ? |
|---|---|
| Unique (%) | 47.9% |
Sample
| 1st row | 01053000BK0039 |
|---|---|
| 2nd row | 01289000AR0388 |
| 3rd row | 01343000ZM0197 |
| 4th row | 01343000ZM0198 |
| 5th row | 01343000ZM0201 |
Common Values
| Value | Count | Frequency (%) |
| 91286000AR0114 | 3058 | 0.1% |
| 33193000AD0001 | 2812 | 0.1% |
| 29039000BL0050 | 1340 | < 0.1% |
| 33193000AC0001 | 1287 | < 0.1% |
| 33193000AA0002 | 1136 | < 0.1% |
| 30189000EM0022 | 828 | < 0.1% |
| 13056000AP0103 | 816 | < 0.1% |
| 593830000B6722 | 740 | < 0.1% |
| 93007000AO0314 | 612 | < 0.1% |
| 47001000AI0309 | 590 | < 0.1% |
| Other values (2029406) | 3369593 |
Length
| Value | Count | Frequency (%) |
| 91286000ar0114 | 3058 | 0.1% |
| 33193000ad0001 | 2812 | 0.1% |
| 29039000bl0050 | 1340 | < 0.1% |
| 33193000ac0001 | 1287 | < 0.1% |
| 33193000aa0002 | 1136 | < 0.1% |
| 30189000em0022 | 828 | < 0.1% |
| 13056000ap0103 | 816 | < 0.1% |
| 593830000b6722 | 740 | < 0.1% |
| 93007000ao0314 | 612 | < 0.1% |
| 47001000ai0309 | 590 | < 0.1% |
| Other values (2029406) | 3369593 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 16409 |
|---|---|
| Distinct (%) | 78.1% |
| Missing | 3361806 |
| Missing (%) | 99.4% |
| Memory size | 25.8 MiB |
| 85166000AC1258 | 152 |
|---|---|
| 91182000AB0141 | 112 |
| 85027000ZC0289 | 97 |
| 91182000AN0528 | 85 |
| 85166000AT0535 | 67 |
| Other values (16404) |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 14024 ? |
|---|---|
| Unique (%) | 66.8% |
Sample
| 1st row | 01154000ZC0118 |
|---|---|
| 2nd row | 01154000ZE0078 |
| 3rd row | 01154000ZH0020 |
| 4th row | 01154000ZH0020 |
| 5th row | 01154000ZH0022 |
Common Values
| Value | Count | Frequency (%) |
| 85166000AC1258 | 152 | < 0.1% |
| 91182000AB0141 | 112 | < 0.1% |
| 85027000ZC0289 | 97 | < 0.1% |
| 91182000AN0528 | 85 | < 0.1% |
| 85166000AT0535 | 67 | < 0.1% |
| 85166000AC1259 | 62 | < 0.1% |
| 85166000AL0321 | 45 | < 0.1% |
| 782510000B0256 | 44 | < 0.1% |
| 85166000AT0557 | 41 | < 0.1% |
| 85166000AX0091 | 33 | < 0.1% |
| Other values (16399) | 20268 | 0.6% |
| (Missing) | 3361806 |
Length
| Value | Count | Frequency (%) |
| 85166000ac1258 | 152 | 0.7% |
| 91182000ab0141 | 112 | 0.5% |
| 85027000zc0289 | 97 | 0.5% |
| 91182000an0528 | 85 | 0.4% |
| 85166000at0535 | 67 | 0.3% |
| 85166000ac1259 | 62 | 0.3% |
| 85166000al0321 | 45 | 0.2% |
| 782510000b0256 | 44 | 0.2% |
| 85166000at0557 | 41 | 0.2% |
| 85166000ax0091 | 33 | 0.2% |
| Other values (16399) | 20268 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
lot1_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWED| Distinct | 18059 |
|---|---|
| Distinct (%) | 6.5% |
| Missing | 3104314 |
| Missing (%) | 91.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68.57213215 |
| Minimum | 0.36 |
|---|---|
| Maximum | 9999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 0.36 |
|---|---|
| 5-th percentile | 16.7 |
| Q1 | 34.0425 |
| median | 53.6 |
| Q3 | 73.68 |
| 95-th percentile | 118.3815 |
| Maximum | 9999 |
| Range | 9998.64 |
| Interquartile range (IQR) | 39.6375 |
Descriptive statistics
| Standard deviation | 217.985836 |
|---|---|
| Coefficient of variation (CV) | 3.178927491 |
| Kurtosis | 851.4324699 |
| Mean | 68.57213215 |
| Median Absolute Deviation (MAD) | 19.77 |
| Skewness | 27.3471126 |
| Sum | 19097201.66 |
| Variance | 47517.82471 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.5 | 628 | < 0.1% |
| 12 | 417 | < 0.1% |
| 15 | 413 | < 0.1% |
| 47 | 385 | < 0.1% |
| 10 | 365 | < 0.1% |
| 30 | 357 | < 0.1% |
| 40 | 355 | < 0.1% |
| 60 | 336 | < 0.1% |
| 20 | 319 | < 0.1% |
| 42 | 316 | < 0.1% |
| Other values (18049) | 274607 | 8.1% |
| (Missing) | 3104314 |
| Value | Count | Frequency (%) |
| 0.36 | 5 | < 0.1% |
| 0.52 | 1 | < 0.1% |
| 0.57 | 1 | < 0.1% |
| 0.6 | 1 | < 0.1% |
| 0.85 | 1 | < 0.1% |
| 0.9 | 2 | < 0.1% |
| 0.97 | 1 | < 0.1% |
| 0.98 | 1 | < 0.1% |
| 0.99 | 1 | < 0.1% |
| 1 | 61 |
| Value | Count | Frequency (%) |
| 9999 | 13 | |
| 9118 | 1 | < 0.1% |
| 8409 | 1 | < 0.1% |
| 7992.2 | 1 | < 0.1% |
| 7912 | 1 | < 0.1% |
| 7264 | 1 | < 0.1% |
| 7101 | 1 | < 0.1% |
| 6939 | 1 | < 0.1% |
| 6933 | 1 | < 0.1% |
| 6887 | 1 | < 0.1% |
lot2_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWED| Distinct | 12325 |
|---|---|
| Distinct (%) | 18.0% |
| Missing | 3314519 |
| Missing (%) | 98.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66.08874145 |
| Minimum | 0.36 |
|---|---|
| Maximum | 8928 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 0.36 |
|---|---|
| 5-th percentile | 23.126 |
| Q1 | 43.27 |
| median | 61.11 |
| Q3 | 76.45 |
| 95-th percentile | 111.32 |
| Maximum | 8928 |
| Range | 8927.64 |
| Interquartile range (IQR) | 33.18 |
Descriptive statistics
| Standard deviation | 130.3559422 |
|---|---|
| Coefficient of variation (CV) | 1.972437958 |
| Kurtosis | 2308.390612 |
| Mean | 66.08874145 |
| Median Absolute Deviation (MAD) | 16.7 |
| Skewness | 45.2560629 |
| Sum | 4513398.42 |
| Variance | 16992.67168 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 68 | 80 | < 0.1% |
| 70 | 73 | < 0.1% |
| 60 | 69 | < 0.1% |
| 55 | 69 | < 0.1% |
| 63 | 69 | < 0.1% |
| 65 | 68 | < 0.1% |
| 67 | 65 | < 0.1% |
| 40 | 63 | < 0.1% |
| 75 | 60 | < 0.1% |
| 64 | 58 | < 0.1% |
| Other values (12315) | 67619 | 2.0% |
| (Missing) | 3314519 |
| Value | Count | Frequency (%) |
| 0.36 | 3 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 0.71 | 1 | < 0.1% |
| 0.97 | 1 | < 0.1% |
| 1 | 13 | |
| 1.05 | 1 | < 0.1% |
| 1.14 | 1 | < 0.1% |
| 1.2 | 1 | < 0.1% |
| 1.22 | 1 | < 0.1% |
| 1.24 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 8928 | 1 | < 0.1% |
| 8525 | 1 | < 0.1% |
| 7996 | 1 | < 0.1% |
| 7949 | 1 | < 0.1% |
| 7215 | 1 | < 0.1% |
| 7108 | 1 | < 0.1% |
| 6478 | 1 | < 0.1% |
| 5680.8 | 19 | |
| 4665 | 1 | < 0.1% |
| 4375 | 1 | < 0.1% |
lot3_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 4689 |
|---|---|
| Distinct (%) | 70.5% |
| Missing | 3376163 |
| Missing (%) | 99.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 93.27204392 |
| Minimum | 0.36 |
|---|---|
| Maximum | 7529.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 0.36 |
|---|---|
| 5-th percentile | 12.5 |
| Q1 | 37.67 |
| median | 61.43 |
| Q3 | 86.36 |
| 95-th percentile | 164.326 |
| Maximum | 7529.1 |
| Range | 7528.74 |
| Interquartile range (IQR) | 48.69 |
Descriptive statistics
| Standard deviation | 394.7234758 |
|---|---|
| Coefficient of variation (CV) | 4.231959108 |
| Kurtosis | 338.4473672 |
| Mean | 93.27204392 |
| Median Absolute Deviation (MAD) | 24.12 |
| Skewness | 18.15884772 |
| Sum | 620165.82 |
| Variance | 155806.6223 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.5 | 27 | < 0.1% |
| 401 | 25 | < 0.1% |
| 7.98 | 20 | < 0.1% |
| 7529.1 | 18 | < 0.1% |
| 70 | 13 | < 0.1% |
| 67 | 13 | < 0.1% |
| 10 | 13 | < 0.1% |
| 12 | 13 | < 0.1% |
| 15 | 12 | < 0.1% |
| 43 | 12 | < 0.1% |
| Other values (4679) | 6483 | 0.2% |
| (Missing) | 3376163 |
| Value | Count | Frequency (%) |
| 0.36 | 1 | < 0.1% |
| 0.38 | 1 | < 0.1% |
| 0.4 | 1 | < 0.1% |
| 0.52 | 1 | < 0.1% |
| 0.68 | 1 | < 0.1% |
| 1 | 2 | |
| 1.03 | 1 | < 0.1% |
| 1.21 | 1 | < 0.1% |
| 1.23 | 1 | < 0.1% |
| 1.5 | 3 |
| Value | Count | Frequency (%) |
| 7529.1 | 18 | |
| 1560.1 | 3 | < 0.1% |
| 1481.52 | 2 | < 0.1% |
| 1090.3 | 1 | < 0.1% |
| 1089.9 | 1 | < 0.1% |
| 1036.3 | 1 | < 0.1% |
| 838 | 1 | < 0.1% |
| 780 | 1 | < 0.1% |
| 756.7 | 1 | < 0.1% |
| 658.07 | 2 | < 0.1% |
lot4_numero
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWED| Distinct | 814 |
|---|---|
| Distinct (%) | 6.5% |
| Missing | 3370210 |
| Missing (%) | 99.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 167.6030789 |
| Minimum | 2 |
|---|---|
| Maximum | 191612 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 8 |
| median | 24 |
| Q3 | 70.75 |
| 95-th percentile | 394.85 |
| Maximum | 191612 |
| Range | 191610 |
| Interquartile range (IQR) | 62.75 |
Descriptive statistics
| Standard deviation | 2699.381251 |
|---|---|
| Coefficient of variation (CV) | 16.10579751 |
| Kurtosis | 3621.121997 |
| Mean | 167.6030789 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | 57.55405177 |
| Sum | 2112134 |
| Variance | 7286659.136 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 702 | < 0.1% |
| 9 | 695 | < 0.1% |
| 8 | 691 | < 0.1% |
| 6 | 685 | < 0.1% |
| 4 | 616 | < 0.1% |
| 5 | 583 | < 0.1% |
| 3 | 301 | < 0.1% |
| 2 | 219 | < 0.1% |
| 13 | 199 | < 0.1% |
| 18 | 153 | < 0.1% |
| Other values (804) | 7758 | 0.2% |
| (Missing) | 3370210 |
| Value | Count | Frequency (%) |
| 2 | 219 | < 0.1% |
| 3 | 301 | |
| 4 | 616 | |
| 5 | 583 | |
| 6 | 685 | |
| 7 | 702 | |
| 8 | 691 | |
| 9 | 695 | |
| 11 | 6 | < 0.1% |
| 12 | 119 | < 0.1% |
| Value | Count | Frequency (%) |
| 191612 | 1 | |
| 161311 | 1 | |
| 141214 | 1 | |
| 53006 | 1 | |
| 30066 | 1 | |
| 25032 | 1 | |
| 25004 | 1 | |
| 23004 | 1 | |
| 20097 | 1 | |
| 20080 | 1 |
lot4_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1444 |
|---|---|
| Distinct (%) | 82.4% |
| Missing | 3381060 |
| Missing (%) | 99.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 114.5579281 |
| Minimum | 0.54 |
|---|---|
| Maximum | 2321 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 0.54 |
|---|---|
| 5-th percentile | 8.655 |
| Q1 | 30.9325 |
| median | 61.5 |
| Q3 | 100 |
| 95-th percentile | 291.415 |
| Maximum | 2321 |
| Range | 2320.46 |
| Interquartile range (IQR) | 69.0675 |
Descriptive statistics
| Standard deviation | 259.3426044 |
|---|---|
| Coefficient of variation (CV) | 2.263855577 |
| Kurtosis | 39.16993756 |
| Mean | 114.5579281 |
| Median Absolute Deviation (MAD) | 33.005 |
| Skewness | 6.101077765 |
| Sum | 200705.49 |
| Variance | 67258.58644 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1945 | 25 | < 0.1% |
| 5.8 | 19 | < 0.1% |
| 25.3 | 9 | < 0.1% |
| 12.5 | 8 | < 0.1% |
| 12 | 8 | < 0.1% |
| 44.99 | 8 | < 0.1% |
| 8 | 6 | < 0.1% |
| 20 | 6 | < 0.1% |
| 15 | 6 | < 0.1% |
| 14.84 | 5 | < 0.1% |
| Other values (1434) | 1652 | < 0.1% |
| (Missing) | 3381060 |
| Value | Count | Frequency (%) |
| 0.54 | 1 | |
| 0.55 | 1 | |
| 0.87 | 1 | |
| 0.98 | 1 | |
| 1 | 2 | |
| 1.17 | 1 | |
| 1.31 | 1 | |
| 1.4 | 1 | |
| 1.58 | 1 | |
| 1.71 | 1 |
| Value | Count | Frequency (%) |
| 2321 | 1 | < 0.1% |
| 1945 | 25 | |
| 1612 | 1 | < 0.1% |
| 1560.1 | 3 | < 0.1% |
| 1481.52 | 1 | < 0.1% |
| 1333.7 | 1 | < 0.1% |
| 1243.52 | 2 | < 0.1% |
| 891.9 | 1 | < 0.1% |
| 743.28 | 1 | < 0.1% |
| 735.85 | 1 | < 0.1% |
lot5_numero
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWED| Distinct | 584 |
|---|---|
| Distinct (%) | 9.7% |
| Missing | 3376821 |
| Missing (%) | 99.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 207.2927725 |
| Minimum | 2 |
|---|---|
| Maximum | 191613 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 8 |
| median | 26 |
| Q3 | 76 |
| 95-th percentile | 437.5 |
| Maximum | 191613 |
| Range | 191611 |
| Interquartile range (IQR) | 68 |
Descriptive statistics
| Standard deviation | 3399.215504 |
|---|---|
| Coefficient of variation (CV) | 16.39813807 |
| Kurtosis | 2531.539074 |
| Mean | 207.2927725 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 48.48277988 |
| Sum | 1241891 |
| Variance | 11554666.05 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 370 | < 0.1% |
| 9 | 359 | < 0.1% |
| 7 | 325 | < 0.1% |
| 5 | 324 | < 0.1% |
| 6 | 289 | < 0.1% |
| 4 | 168 | < 0.1% |
| 14 | 122 | < 0.1% |
| 3 | 118 | < 0.1% |
| 2 | 90 | < 0.1% |
| 15 | 82 | < 0.1% |
| Other values (574) | 3744 | 0.1% |
| (Missing) | 3376821 |
| Value | Count | Frequency (%) |
| 2 | 90 | < 0.1% |
| 3 | 118 | < 0.1% |
| 4 | 168 | |
| 5 | 324 | |
| 6 | 289 | |
| 7 | 325 | |
| 8 | 370 | |
| 9 | 359 | |
| 11 | 2 | < 0.1% |
| 12 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 191613 | 1 | |
| 161312 | 1 | |
| 53007 | 1 | |
| 25033 | 1 | |
| 25005 | 1 | |
| 20098 | 1 | |
| 20081 | 1 | |
| 20022 | 1 | |
| 12004 | 1 | |
| 10027 | 1 |
lot5_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 630 |
|---|---|
| Distinct (%) | 83.8% |
| Missing | 3382060 |
| Missing (%) | > 99.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 95.3599867 |
| Minimum | 0.6 |
|---|---|
| Maximum | 1560.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 0.6 |
|---|---|
| 5-th percentile | 6.944 |
| Q1 | 24.7975 |
| median | 62.355 |
| Q3 | 107.38 |
| 95-th percentile | 288.11 |
| Maximum | 1560.1 |
| Range | 1559.5 |
| Interquartile range (IQR) | 82.5825 |
Descriptive statistics
| Standard deviation | 144.6331889 |
|---|---|
| Coefficient of variation (CV) | 1.516707309 |
| Kurtosis | 47.52928584 |
| Mean | 95.3599867 |
| Median Absolute Deviation (MAD) | 41.24 |
| Skewness | 5.824527235 |
| Sum | 71710.71 |
| Variance | 20918.75932 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.66 | 19 | < 0.1% |
| 26.9 | 8 | < 0.1% |
| 16.97 | 6 | < 0.1% |
| 21.44 | 5 | < 0.1% |
| 12.5 | 5 | < 0.1% |
| 38.78 | 5 | < 0.1% |
| 10 | 4 | < 0.1% |
| 45.8 | 4 | < 0.1% |
| 1560.1 | 3 | < 0.1% |
| 13 | 3 | < 0.1% |
| Other values (620) | 690 | < 0.1% |
| (Missing) | 3382060 |
| Value | Count | Frequency (%) |
| 0.6 | 1 | |
| 0.86 | 1 | |
| 1.2 | 2 | |
| 1.4 | 1 | |
| 1.67 | 1 | |
| 1.71 | 1 | |
| 2.93 | 1 | |
| 3.3 | 1 | |
| 3.79 | 1 | |
| 4.08 | 1 |
| Value | Count | Frequency (%) |
| 1560.1 | 3 | |
| 1047.7 | 1 | < 0.1% |
| 927.2 | 1 | < 0.1% |
| 884.88 | 1 | < 0.1% |
| 805 | 1 | < 0.1% |
| 728.7 | 1 | < 0.1% |
| 658.07 | 1 | < 0.1% |
| 651.5 | 1 | < 0.1% |
| 643.33 | 1 | < 0.1% |
| 602.74 | 1 | < 0.1% |
| Distinct | 82 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.403809316 |
| Minimum | 0 |
|---|---|
| Maximum | 255 |
| Zeros | 2313128 |
| Zeros (%) | 68.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 255 |
| Range | 255 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8186193593 |
|---|---|
| Coefficient of variation (CV) | 2.027242381 |
| Kurtosis | 5522.619568 |
| Mean | 0.403809316 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 33.01288703 |
| Sum | 1366011 |
| Variance | 0.6701376555 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2313128 | |
| 1 | 848066 | 25.1% |
| 2 | 184943 | 5.5% |
| 3 | 24073 | 0.7% |
| 4 | 6611 | 0.2% |
| 5 | 2389 | 0.1% |
| 6 | 1254 | < 0.1% |
| 7 | 631 | < 0.1% |
| 8 | 468 | < 0.1% |
| 9 | 278 | < 0.1% |
| Other values (72) | 971 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2313128 | |
| 1 | 848066 | 25.1% |
| 2 | 184943 | 5.5% |
| 3 | 24073 | 0.7% |
| 4 | 6611 | 0.2% |
| 5 | 2389 | 0.1% |
| 6 | 1254 | < 0.1% |
| 7 | 631 | < 0.1% |
| 8 | 468 | < 0.1% |
| 9 | 278 | < 0.1% |
| Value | Count | Frequency (%) |
| 255 | 1 | |
| 184 | 1 | |
| 137 | 1 | |
| 126 | 1 | |
| 118 | 1 | |
| 117 | 1 | |
| 116 | 1 | |
| 112 | 1 | |
| 108 | 1 | |
| 105 | 1 |
code_type_local
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1508037 |
| Missing (%) | 44.6% |
| Memory size | 25.8 MiB |
| 1.0 | |
|---|---|
| 2.0 | |
| 3.0 | |
| 4.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 2.0 |
| 4th row | 2.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 660584 | |
| 2.0 | 611108 | |
| 3.0 | 475844 | 14.1% |
| 4.0 | 127239 | 3.8% |
| (Missing) | 1508037 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1.0 | 660584 | |
| 2.0 | 611108 | |
| 3.0 | 475844 | |
| 4.0 | 127239 | 6.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1508037 |
| Missing (%) | 44.6% |
| Memory size | 25.8 MiB |
| Maison | |
|---|---|
| Appartement | |
| Dépendance | |
| Local industriel. commercial ou assimilé |
Length
| Max length | 40 |
|---|---|
| Median length | 10 |
| Mean length | 10.95261671 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Appartement |
|---|---|
| 2nd row | Dépendance |
| 3rd row | Appartement |
| 4th row | Appartement |
| 5th row | Appartement |
Common Values
| Value | Count | Frequency (%) |
| Maison | 660584 | |
| Appartement | 611108 | |
| Dépendance | 475844 | 14.1% |
| Local industriel. commercial ou assimilé | 127239 | 3.8% |
| (Missing) | 1508037 |
Length
Pie chart
| Value | Count | Frequency (%) |
| maison | 660584 | |
| appartement | 611108 | |
| dépendance | 475844 | |
| assimilé | 127239 | 5.3% |
| ou | 127239 | 5.3% |
| commercial | 127239 | 5.3% |
| industriel | 127239 | 5.3% |
| local | 127239 | 5.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 4596 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1992066 |
| Missing (%) | 58.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 117.7884474 |
| Minimum | 1 |
|---|---|
| Maximum | 646230 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 49 |
| median | 74 |
| Q3 | 103 |
| 95-th percentile | 186 |
| Maximum | 646230 |
| Range | 646229 |
| Interquartile range (IQR) | 54 |
Descriptive statistics
| Standard deviation | 1747.355457 |
|---|---|
| Coefficient of variation (CV) | 14.83469301 |
| Kurtosis | 109304.3218 |
| Mean | 117.7884474 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 305.1601927 |
| Sum | 163813812 |
| Variance | 3053251.092 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 26855 | 0.8% |
| 60 | 25802 | 0.8% |
| 70 | 24171 | 0.7% |
| 90 | 23160 | 0.7% |
| 50 | 20512 | 0.6% |
| 100 | 20205 | 0.6% |
| 65 | 20178 | 0.6% |
| 40 | 19196 | 0.6% |
| 45 | 17780 | 0.5% |
| 75 | 16553 | 0.5% |
| Other values (4586) | 1176334 | |
| (Missing) | 1992066 |
| Value | Count | Frequency (%) |
| 1 | 338 | < 0.1% |
| 2 | 344 | < 0.1% |
| 3 | 342 | < 0.1% |
| 4 | 183 | < 0.1% |
| 5 | 381 | < 0.1% |
| 6 | 390 | < 0.1% |
| 7 | 360 | < 0.1% |
| 8 | 717 | < 0.1% |
| 9 | 1066 | < 0.1% |
| 10 | 2907 |
| Value | Count | Frequency (%) |
| 646230 | 8 | |
| 317173 | 1 | < 0.1% |
| 272600 | 1 | < 0.1% |
| 212120 | 2 | < 0.1% |
| 166801 | 1 | < 0.1% |
| 137182 | 1 | < 0.1% |
| 132233 | 1 | < 0.1% |
| 121031 | 1 | < 0.1% |
| 120000 | 1 | < 0.1% |
| 114742 | 1 | < 0.1% |
nombre_pieces_principales
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 54 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1510700 |
| Missing (%) | 44.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.340234452 |
| Minimum | 0 |
|---|---|
| Maximum | 112 |
| Zeros | 603218 |
| Zeros (%) | 17.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 112 |
| Range | 112 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.075923995 |
|---|---|
| Coefficient of variation (CV) | 0.8870581295 |
| Kurtosis | 11.96713614 |
| Mean | 2.340234452 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.8435610494 |
| Sum | 4381181 |
| Variance | 4.309460435 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 603218 | 17.8% |
| 3 | 317013 | 9.4% |
| 4 | 308742 | 9.1% |
| 2 | 228614 | 6.8% |
| 5 | 177740 | 5.3% |
| 1 | 128653 | 3.8% |
| 6 | 67586 | 2.0% |
| 7 | 25042 | 0.7% |
| 8 | 9011 | 0.3% |
| 9 | 3331 | 0.1% |
| Other values (44) | 3162 | 0.1% |
| (Missing) | 1510700 |
| Value | Count | Frequency (%) |
| 0 | 603218 | |
| 1 | 128653 | 3.8% |
| 2 | 228614 | 6.8% |
| 3 | 317013 | |
| 4 | 308742 | |
| 5 | 177740 | 5.3% |
| 6 | 67586 | 2.0% |
| 7 | 25042 | 0.7% |
| 8 | 9011 | 0.3% |
| 9 | 3331 | 0.1% |
| Value | Count | Frequency (%) |
| 112 | 1 | |
| 93 | 1 | |
| 78 | 1 | |
| 70 | 1 | |
| 68 | 1 | |
| 66 | 1 | |
| 65 | 1 | |
| 60 | 1 | |
| 58 | 1 | |
| 55 | 1 |
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1079606 |
| Missing (%) | 31.9% |
| Memory size | 25.8 MiB |
| S | |
|---|---|
| T | |
| P | |
| AB | |
| J | |
| Other values (22) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.209177121 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AB |
|---|---|
| 2nd row | P |
| 3rd row | P |
| 4th row | P |
| 5th row | P |
Common Values
| Value | Count | Frequency (%) |
| S | 1078512 | |
| T | 336587 | 9.9% |
| P | 176385 | 5.2% |
| AB | 155548 | 4.6% |
| J | 116100 | 3.4% |
| L | 91531 | 2.7% |
| BT | 91324 | 2.7% |
| AG | 80456 | 2.4% |
| VI | 40718 | 1.2% |
| BR | 34074 | 1.0% |
| Other values (17) | 101971 | 3.0% |
| (Missing) | 1079606 |
Length
| Value | Count | Frequency (%) |
| s | 1078512 | |
| t | 336587 | 14.6% |
| p | 176385 | 7.7% |
| ab | 155548 | 6.8% |
| j | 116100 | 5.0% |
| l | 91531 | 4.0% |
| bt | 91324 | 4.0% |
| ag | 80456 | 3.5% |
| vi | 40718 | 1.8% |
| br | 34074 | 1.5% |
| Other values (17) | 101971 | 4.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1079606 |
| Missing (%) | 31.9% |
| Memory size | 25.8 MiB |
| sols | |
|---|---|
| terres | |
| prés | |
| terrains a bâtir | |
| jardins | |
| Other values (22) |
Length
| Max length | 19 |
|---|---|
| Median length | 4 |
| Mean length | 6.774368424 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | terrains a bâtir |
|---|---|
| 2nd row | prés |
| 3rd row | prés |
| 4th row | prés |
| 5th row | prés |
Common Values
| Value | Count | Frequency (%) |
| sols | 1078512 | |
| terres | 336587 | 9.9% |
| prés | 176385 | 5.2% |
| terrains a bâtir | 155548 | 4.6% |
| jardins | 116100 | 3.4% |
| landes | 91531 | 2.7% |
| taillis simples | 91324 | 2.7% |
| terrains d'agrément | 80456 | 2.4% |
| vignes | 40718 | 1.2% |
| futaies résineuses | 34074 | 1.0% |
| Other values (17) | 101971 | 3.0% |
| (Missing) | 1079606 |
Length
| Value | Count | Frequency (%) |
| sols | 1078512 | |
| terres | 336682 | 11.8% |
| terrains | 236004 | 8.2% |
| prés | 178998 | 6.2% |
| a | 155548 | 5.4% |
| bâtir | 155548 | 5.4% |
| jardins | 116100 | 4.1% |
| taillis | 107842 | 3.8% |
| landes | 91834 | 3.2% |
| simples | 91324 | 3.2% |
| Other values (24) | 316392 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 125 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 3230429 |
| Missing (%) | 95.5% |
| Memory size | 25.8 MiB |
| POTAG | |
|---|---|
| PATUR | |
| PIN | |
| PARC | |
| FRICH | |
| Other values (120) |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.484279742 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | IMM |
|---|---|
| 2nd row | IMM |
| 3rd row | IMM |
| 4th row | IMM |
| 5th row | IMM |
Common Values
| Value | Count | Frequency (%) |
| POTAG | 32312 | 1.0% |
| PATUR | 15859 | 0.5% |
| PIN | 12896 | 0.4% |
| PARC | 12788 | 0.4% |
| FRICH | 10053 | 0.3% |
| VAOC | 7562 | 0.2% |
| CHAT | 4801 | 0.1% |
| IMM | 3866 | 0.1% |
| MARAI | 3475 | 0.1% |
| CHENE | 3230 | 0.1% |
| Other values (115) | 45541 | 1.3% |
| (Missing) | 3230429 |
Length
| Value | Count | Frequency (%) |
| potag | 32312 | |
| patur | 15859 | 10.4% |
| pin | 12896 | 8.5% |
| parc | 12788 | 8.4% |
| frich | 10053 | 6.6% |
| vaoc | 7562 | 5.0% |
| chat | 4801 | 3.2% |
| imm | 3866 | 2.5% |
| marai | 3475 | 2.3% |
| chene | 3230 | 2.1% |
| Other values (115) | 45541 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 125 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 3230429 |
| Missing (%) | 95.5% |
| Memory size | 25.8 MiB |
| Jardin potager | |
|---|---|
| Pâture plantée | |
| Pins | |
| Parc | |
| Friche | |
| Other values (120) |
Length
| Max length | 38 |
|---|---|
| Median length | 14 |
| Mean length | 12.54715421 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Dépendances d'ensemble immobilier |
|---|---|
| 2nd row | Dépendances d'ensemble immobilier |
| 3rd row | Dépendances d'ensemble immobilier |
| 4th row | Dépendances d'ensemble immobilier |
| 5th row | Dépendances d'ensemble immobilier |
Common Values
| Value | Count | Frequency (%) |
| Jardin potager | 32312 | 1.0% |
| Pâture plantée | 15859 | 0.5% |
| Pins | 12896 | 0.4% |
| Parc | 12788 | 0.4% |
| Friche | 10053 | 0.3% |
| Vins d'appellation d'origine contrôlée | 7562 | 0.2% |
| Châtaigneraie | 4801 | 0.1% |
| Dépendances d'ensemble immobilier | 3866 | 0.1% |
| Pré marais | 3475 | 0.1% |
| Chênes | 3230 | 0.1% |
| Other values (115) | 45541 | 1.3% |
| (Missing) | 3230429 |
Length
| Value | Count | Frequency (%) |
| jardin | 33856 | 12.2% |
| potager | 32312 | 11.7% |
| pâture | 15859 | 5.7% |
| plantée | 15859 | 5.7% |
| pins | 12896 | 4.7% |
| parc | 12792 | 4.6% |
| friche | 10053 | 3.6% |
| ou | 7784 | 2.8% |
| vins | 7718 | 2.8% |
| d'origine | 7562 | 2.7% |
| Other values (159) | 120312 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 46226 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 1079656 |
| Missing (%) | 31.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4134.806089 |
| Minimum | 1 |
|---|---|
| Maximum | 4620522 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 30 |
| Q1 | 237 |
| median | 624 |
| Q3 | 1922 |
| 95-th percentile | 13312 |
| Maximum | 4620522 |
| Range | 4620521 |
| Interquartile range (IQR) | 1685 |
Descriptive statistics
| Standard deviation | 24181.58361 |
|---|---|
| Coefficient of variation (CV) | 5.848299314 |
| Kurtosis | 1653.967431 |
| Mean | 4134.806089 |
| Median Absolute Deviation (MAD) | 499 |
| Skewness | 25.42368337 |
| Sum | 9523103453 |
| Variance | 584748986.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500 | 44352 | 1.3% |
| 1000 | 21616 | 0.6% |
| 800 | 6696 | 0.2% |
| 600 | 6519 | 0.2% |
| 12 | 6029 | 0.2% |
| 400 | 5559 | 0.2% |
| 13 | 5430 | 0.2% |
| 700 | 5308 | 0.2% |
| 2000 | 5168 | 0.2% |
| 200 | 5125 | 0.2% |
| Other values (46216) | 2191354 | |
| (Missing) | 1079656 |
| Value | Count | Frequency (%) |
| 1 | 4983 | |
| 2 | 3976 | |
| 3 | 3726 | |
| 4 | 3814 | |
| 5 | 3977 | |
| 6 | 3829 | |
| 7 | 3552 | |
| 8 | 3731 | |
| 9 | 3486 | |
| 10 | 4469 |
| Value | Count | Frequency (%) |
| 4620522 | 1 | |
| 3058525 | 1 | |
| 3041018 | 1 | |
| 2636576 | 1 | |
| 2633047 | 2 | |
| 2604157 | 1 | |
| 2479458 | 1 | |
| 2445345 | 2 | |
| 1833430 | 1 | |
| 1754953 | 1 |
| Distinct | 1774859 |
|---|---|
| Distinct (%) | 54.1% |
| Missing | 99869 |
| Missing (%) | 3.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.264622777 |
| Minimum | -63.152364 |
|---|---|
| Maximum | 55.828599 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 756135 |
| Negative (%) | 22.4% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | -63.152364 |
|---|---|
| 5-th percentile | -2.210359 |
| Q1 | 0.2140255 |
| median | 2.343244 |
| Q3 | 4.500259 |
| 95-th percentile | 6.582639 |
| Maximum | 55.828599 |
| Range | 118.980963 |
| Interquartile range (IQR) | 4.2862335 |
Descriptive statistics
| Standard deviation | 6.096855988 |
|---|---|
| Coefficient of variation (CV) | 2.692217022 |
| Kurtosis | 70.80281769 |
| Mean | 2.264622777 |
| Median Absolute Deviation (MAD) | 2.136624 |
| Skewness | -1.462984273 |
| Sum | 7434627.493 |
| Variance | 37.17165293 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1.151664 | 2812 | 0.1% |
| -3.917273 | 1340 | < 0.1% |
| -1.133883 | 1287 | < 0.1% |
| -1.148546 | 1137 | < 0.1% |
| 4.33429 | 828 | < 0.1% |
| 5.039083 | 816 | < 0.1% |
| 2.475669 | 612 | < 0.1% |
| 0.628714 | 590 | < 0.1% |
| 2.285987 | 568 | < 0.1% |
| 6.015076 | 568 | < 0.1% |
| Other values (1774849) | 3272385 | |
| (Missing) | 99869 | 3.0% |
| Value | Count | Frequency (%) |
| -63.152364 | 1 | < 0.1% |
| -63.138125 | 1 | < 0.1% |
| -63.138027 | 2 | < 0.1% |
| -63.131398 | 2 | < 0.1% |
| -63.128902 | 4 | < 0.1% |
| -63.115044 | 7 | < 0.1% |
| -63.114977 | 7 | < 0.1% |
| -63.114394 | 1 | < 0.1% |
| -63.112347 | 1 | < 0.1% |
| -63.11152 | 23 |
| Value | Count | Frequency (%) |
| 55.828599 | 1 | < 0.1% |
| 55.827382 | 4 | |
| 55.825482 | 1 | < 0.1% |
| 55.825313 | 1 | < 0.1% |
| 55.824223 | 1 | < 0.1% |
| 55.824207 | 1 | < 0.1% |
| 55.824182 | 1 | < 0.1% |
| 55.822633 | 1 | < 0.1% |
| 55.822018 | 1 | < 0.1% |
| 55.820668 | 1 | < 0.1% |
| Distinct | 1717835 |
|---|---|
| Distinct (%) | 52.3% |
| Missing | 99869 |
| Missing (%) | 3.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46.17557609 |
| Minimum | -21.385366 |
|---|---|
| Maximum | 51.082118 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 15371 |
| Negative (%) | 0.5% |
| Memory size | 25.8 MiB |
Quantile statistics
| Minimum | -21.385366 |
|---|---|
| 5-th percentile | 43.2229215 |
| Q1 | 44.701166 |
| median | 46.727986 |
| Q3 | 48.70185 |
| 95-th percentile | 49.921988 |
| Maximum | 51.082118 |
| Range | 72.467484 |
| Interquartile range (IQR) | 4.000684 |
Descriptive statistics
| Standard deviation | 5.577619068 |
|---|---|
| Coefficient of variation (CV) | 0.1207915426 |
| Kurtosis | 102.0080856 |
| Mean | 46.17557609 |
| Median Absolute Deviation (MAD) | 1.986675 |
| Skewness | -9.126972346 |
| Sum | 151591784.3 |
| Variance | 31.10983446 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 45.412412 | 2812 | 0.1% |
| 47.882578 | 1340 | < 0.1% |
| 45.413709 | 1287 | < 0.1% |
| 45.417461 | 1136 | < 0.1% |
| 43.823295 | 828 | < 0.1% |
| 43.408129 | 816 | < 0.1% |
| 48.926664 | 612 | < 0.1% |
| 44.208483 | 590 | < 0.1% |
| 48.813074 | 569 | < 0.1% |
| 43.714376 | 565 | < 0.1% |
| Other values (1717825) | 3272388 | |
| (Missing) | 99869 | 3.0% |
| Value | Count | Frequency (%) |
| -21.385366 | 1 | < 0.1% |
| -21.385337 | 1 | < 0.1% |
| -21.384688 | 1 | < 0.1% |
| -21.384669 | 1 | < 0.1% |
| -21.384599 | 1 | < 0.1% |
| -21.38458 | 1 | < 0.1% |
| -21.384315 | 2 | |
| -21.384314 | 1 | < 0.1% |
| -21.384179 | 1 | < 0.1% |
| -21.384034 | 4 |
| Value | Count | Frequency (%) |
| 51.082118 | 3 | |
| 51.082045 | 2 | < 0.1% |
| 51.081947 | 6 | |
| 51.081765 | 5 | |
| 51.08171 | 2 | < 0.1% |
| 51.081678 | 2 | < 0.1% |
| 51.081576 | 3 | |
| 51.081375 | 3 | |
| 51.081102 | 3 | |
| 51.080872 | 1 | < 0.1% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.